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BACKGROUND OF THE INVENTION 
^ Modem psychiatry typically subdivides mood disorders into bipolar disorders 

PJ (episodes of mania or botii mania and depression) and unipolar depressive disorder 

(episodes of depression). Symptoms of mania include expansive, elevated or irritable 
1 0 mood, inflated self-esteem, grandiosity, decreased need for sleep, increased 

talkativeness, racing thoughts, distractibility, increased goal-directed activity, and 
excessive involvement in pleasurable activities with a high potential for painM 
consequences. Depressive symptoms include depressed mood, diminished interest or 
pleasure in activities, insomnia or hypersomnia, psychomotor agitation or retardation, 
1 5 fatigue or loss of energy, feelings of worthlessness, excessive guilt, inability to 

concentrate or act decisively, and recurrent thoughts of death or suicide. Several mental 
disorders have been proposed as alternate expressions of a bipolar genotype, including 
variants of schizoaffective disorder, recurrent unipolar depression and hypomania 
(bipolar II disorder). 



Neuropsychiatric disorders, such as schizophrenia, attention deficit disorders, 
schizoaffective disorders, bipolar disorders and unipolar disorders, differ from 
neurological disorders in that anatomical or biochemical pathologies are readily 
detectable for the latter but not the former. Largely as a result of this difference, drugs 
5 which have been used to treat individuals with neuropsychiatric disorders, including 
lithium salts, valproic acid and carbamazepine, have not been predictably effective in 
treatment regimens across a variety of patients. Treatment regimens are further 
complicated by the fact that cUnical diagnosis currently relies on clinical observation and 
subjective reports. Identification of the anatomical or biochemical defects which result 
10 in neuropsychiatric disorders is needed in order to effectively distinguish between the 
disorders and to allow the design and administotion of effective therapeutics for these 
disorders, 

SUMMARY OF THE INVENTION 

As described herein, polymorphisms in the gene for brain-derived neurotrophic 

15 factor (BDNF) have been discovered, and at least one of the polymorphisms is correlated 
with incidence of neuropsychiatric disorders (e.g., bipolar disorder). A polymorphism at 
nucleotide position 31 in human brain-derived neurotrophic factor (as numbered in SEQ 
ID NO: 1, GenBank Accession No: M61 181) has been discovered in which the 
reference "T" (thymine) is changed to "A" (adenine). 

20 Furthermore, a single nucleotide polymorphism has been discovered within the 

nucleotide sequence encoding the 128 amino acid prepro portion of the BDNF gene 
product which is correlated with reduced incidence of bipolar disorder in a sample 
population assessed as described herein. In one embodiment, a single nucleotide 
polymorphism fi-om "G" to "A" at nucleotide position 858 (as numbered in SEQ ID NO: 

25 1), resulting in an amino acid change from valine to methionine at amino acid position - 
63 (relative to the start of the mature protein), is correlated with a reduced incidence of 
bipolar disorder in the sample population assessed as described herein. That is, it has 
been determined that there is a variation from random (i.e., that which would be 



expected by chance) in the transmission of the reference "G" (guanine) and variant "A" 
(adenine) at position 858 from a parent who is heterozygous for the BDNF alleles to an 
offspring diagnosed with bipolar disorder. It appears that this variant allele of the SNP 
in the prepro region of BDNF may contribute to protection or reduction m 
symptomology with respect to bipolar disorder. Alternatively, this particular 
polymorphism may be one of a group of two or more polymorphisms in the BDNF gene 
which contributes to the presence, absence or severity of the neuropsychiatric disorder, 
e.g., bipolar disorder. 

The invention relates to methods for diagnosing and treating neuropsychiatric 
disorders, especially bipolar disorder, and to methods for identifying compounds for use 
in the diagnosis and ti-eatinent of neuropsychiatric disorders. The invention relates to 
novel compounds and pharmaceutical compositions for use in the diagnosis and 
treatment of neuropsychiatric disorders. The invention further relates to kits for use in 
diagnosing neuropsychiatiic disorders. In a preferred embodiment, the neuropsychiatric 
disorder is bipolar disorder. 

In one embodiment, the invention relates to a method for predicting the 
likelihood that an individvial will have a neviropsychiatric disorder (or aiding in the 
diagnosis of a neuropsychiatric disorder), e.g., bipolar disorder, comprising the steps of 
obtaining a DNA sample from an individual to be assessed and determining the 
nucleotide present at nucleotide position 31 of the BDNF gene, as numbered in SEQ ID 
NO: 1. The presence of a "T" at position 31 indicates that the mdividual has a reduced 
likelihood of being diagnosed with a neuropsychiatric disorder than an individual having 
an "A" at that position, hi a preferred embodunent, the neuropsychiatiic disorder is 
bipolar disorder, hi a particular embodiment, the individual is an individual at risk for 
development of bipolar disorder. 

The method comprises obtaining a DNA sample from an individual to be 
assessed. The DNA sample comprises a polynucleotide sequence of the BDNF gene or 
portion thereof comprising position 31 of SEQ ID NO: 1. The nucleotide present at 
position 31 of said polynucleotide sequence is determined. The identity of the nucleotide 



at position 3 1 can be determined by nucleic acid detection methods well known in the 
art. 

In another embodiment, the invention is drawn to a method of predicting the 
likelihood that an individual will have reduced symptomology associated with a 
neuropsychiatric disorder, comprising the steps of obtaining a DNA sample from an 
individual to be assessed and determining the nucleotide present at nucleotide position 
3 1 of the BDNF gene, as numbered in SEQ ID NO: 1 . The presence of a "T" at 
position 31 indicates that the individual will have reduced symptomology associated 
with a neuropsychiatric disorder. 

In another embodiment, the presence of an "A" at position 3 1 , as numbered in 
SEQ ID NO: 1 indicates that a person will have an increased likeUhood of being 
diagnosed with a neuropsychiatric disorder as compared with an mdividual having a 'T" 
at the position. 

The invention also relates to a kit for determining the genotype of a nucleotide 
corresponding to position 31 of SEQ ID NO: 1 in a polynucleotide sequence of interest. 
The kit comprises one or more nucleic acid probes, wherein one of said probes 
hybridizes to the polynucleotide sequence of interest, wherein the polynucleotide 
sequence of interest comprises the BDNF gene, its complement, or portion thereof and 
wherein the polynucleotide sequence of interest includes a nucleotide corresponding to 
position 3 1 of SEQ ID NO: 1 . The kit can also comprise control nucleic acid samples 
representing the genotype of at least one of the group consistmg of: an individual 
homozygous for an "A" at nucleotide position 31 of a BDNF gene, an individual 
homozygous for a "T" at nucleotide position 31 of a BDNF gene, and an mdividual 
heterozygous for said position, wherein position 31 corresponds to position 31 of SEQ 
ID NO: 1 . The kits of the present invention are particularly suited for use in the method 
of the present mvention, e.g., for predicting the likelihood that an individual will have or 
be diagnosed with a neuropsychiatric disorder, such as a bipolar disorder. In one 
embodiment, the kit comprises an SBE-FRET primer, wherein said primer hybridizes to 
a polynucleotide sequence comprising position 3 1 of SEQ ED NO: 1 . In one 
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embodiment, the polynucleotide sequence of interest is at least about 10 nucleotides in 
length. In another embodiment, the polynucleotide sequence is at least about 20 
nucleotides in length. 

In still another embodiment, the mvention relates to a microarray, wherein the 
5 microarray has umnobilized thereon a plurality of probes, wherein at least one of said 
probes is specific for the variant form of the single nucleotide polymorphism at position 
31 of SEQ ID NO: 1. In another embodiment, at least one of the probes is specific for 
the reference form of the single nucleotide polymorphism at position 3 1 of SEQ ID NO: 

O 1 0 The invention also relates to a nucleic acid molecule, wherein the nucleic acid 

M 

%l molecule comprises a nucleic acid sequence which is at least 1 0 nucleotides in length. 

^ Said nucleic acid molecule includes a nucleotide corresponding to position 3 1 of SEQ ID 

NO: 1 or its complement wherein said nucleotide at position 31 of SEQ ID NO: 1 is an 
"A." 

W BRIEF DESCRIPTION OF THE DRAWINGS 

ry Figure 1 shows the polypeptide and polynucleotide sequence of BDNF, GenBank 

Accession M61 181, SEQ ID NOs: 2 and 1, respectively. 

Figure 2 shows the estimated relative risk of developing bipolar disorder based 
20 on data and analyses from the indicated groups of affected individuals and described in 
the ExempUfication. 



few 



DETAILED DESCRIPTION OF THE INVENTION 

The development and maintenance of the vertebrate nervous system depends, in 
part, on the physiological availability of neuronal survival proteins known as 
25 neurotrophic factors. Neurotrophic factors play a role in maintaining neurons and their 
differentiated phenotypes in the adult nervous system. Nerve growth factor (NGF) 
remains the best characterized neurotrophic factor. However, brain-derived neurotrophic 
factor (BDNF) has been cloned and shown to be homologous to NGF (Leibrock et al. 



Nature 5^7:149-152 (1989); Hofer et al, EMBO J. 9:2459-2464 (1990); Maisonpierre et 
al. Genomics 70:558-568 (1991)). BDNF is initially synthesized as a 247 amino acid 
protein precursor that is subsequently cleaved to yield the mature protein. The mature 
form of BDNF essentially corresponds to the C-terminal half of its precursor and 

5 comprises 119 amino acids. In the developing rat, BDNF expression undergoes an 
increase from initially low levels, and in the adult rat central nervous system, BDNF is 
expressed at its highest level in the hippocampus. Expression of BDNF is detectable in 
adult tissues outside of the central nervous system only in heart, lung and skeletal muscle 
(Maisonpierre et al. Science, 247:1446-1451 (1990); Hofer et aL, EMBO J.. 9:2459- 

10 2464(1990)). 

As used herein, polymorphism refers to the occurrence of two or more 
genetically determined alternative sequences or alleles in a population. A polymorphic 
marker or site is the locus at which divergence occurs. Preferred markers have at least 
two alleles, each occurring at frequency of greater than 1%, and more preferably greater 

15 than 10% or 20% of a selected population. A polymorphic locus maybe as small as one 
base pair, in which case it is referred to as a single nucleotide polymorphism. 

As described herein, polymorphisms in the gene for BDNF have been 
discovered. In one embodiment, a single polymorphism from T to A at nucleotide 
position 31 in the BDNF gene, as numbered in SEQ ID NO: 1, or at a nucleotide 

20 position corresponding thereto, has been discovered. It has also been discovered that 
one or more single nucleotide polymorphisms within the nucleotide sequence encoding 
the amino acid prepro portion of the BDNF gene product are correlated with a reduced 
incidence of bipolar disorder in the sample population assessed as described herein. For 
example, a single polymorphism from G to A at nucleotide position 858 of SEQ ID NO: 

25 1, resulting in an amino acid change from vaUne to methionine at amino acid position - 
63 (relative to the start of the mature protein), or at an amino acid position corresponding 
thereto, is correlated with a reduced incidence of bipolar disorder in the sample 
population assessed as described herein. This polymorphism resides within the amino 
acid precursor portion (the prepro portion) which is cleaved from the mature protein. 



It appears that the variant allele at position 858 of BDNF may contribute to 
protection or reduction in symptomology with respect to bipolar disorder. Alternatively, 
this particular polymorphism may be one of a group of two or more polymorphisms in 
the BDNF gene which contributes to the presence, absence or severity of the 

5 neuropsycMatric disorder, e.g., bipolar disorder. Therefore, because of the linkage 
disequilibrium, the transmission of at "T" at position 3 1 as numbered in SEQ ID NO: 1 
is linked to transmission of "A" at position 858. 

Variation at nucleotide 3 1 of SEQ ID NO: 1 is not in the coding region of the 
BDNF gene. Therefore, in terms of a phenotypic effect on the BDNF protein, variation 

10 at position 3 1 is silent. However, because blocks of the genome are consistently 

inherited together in a population (linkage disequilibrium), nearby linked SNPs (whether 
silent or not) may also reveal an association to an underlying causative SNP. As 
described herein, the presence of "T" at position 31 of the BDNF gene (as numbered in 
SEQ ID NO: 1) is in complete linkage disequilibrium with a variation at position 858. 

15 Position 858, in tum, shows a variation from random in the transmission of the reference 
"G" and variant "A" alleles from an individual parent who is heterozygous for the BDNF 
alleles to an offspring diagnosed with bipolar disorder. As described herein, the 
transmission of "A" at position 858 is associated with a reduced incidence of bipolar 
disorder. Therefore, because of the linkage disequilibrium, the transmission of "T" at 

20 position 3 1 as numbered in SEQ ID NO: 1 is linked to transmission of "A" at position 
858, 

Therefore, while not wishing to be bound by theory, variation at position 3 1 can 
aid in the diagnosis or prognosis of a neuropsychiatric disorder, such as bipolar disorder, 
due to linkage disequilibrium of position 31 with position 858. Determination of the 

25 identity of nucleotide 31 of the BDNF gene, as numbered in SEQ ID NO: 1, can be used 
to aid in the formulation of a diagnosis or prognosis of neuropsychiatric disease, such as 
bipolar disorder. Furthermore, as more allelic variations are discovered in genes 
involved in neuropsychiatric disorders, the ability to assess the genotype of an individual 
at one or more loci could facilitate the formulation of a diagnosis or prognosis of 

30 neuropsychiatric diseases, such as bipolar disorder. 
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Thus, the invention relates to a method for predicting the likehhood that an 
individual will have a neuropsychiatric disorder, or for aiding in the diagnosis of a 
neuropsychiatric disorder, e.g., bipolar disorder, or a greater Ukelihood of having 
reduced symptomology associated with a neuropsychiatric disorder, e.g., bipolar 
5 disorder, comprising the steps of obtaining a DNA sample from an individual to be 
assessed and determining the nucleotide genotype at nucleotide position 31 of the BDNF 
gene as numbered in SEQ E) NO: 1. In one embodiment, the nucleotide present at 
position 31 is identified. In a preferred embodiment, the neuropsychiatric disorder is 
bipolar disorder. La a particular embodiment, the individual is an individual at risk for 

10 development of bipolar disorder, hi another embodiment the individual exhibits clinical 
symptomology associated with bipolar disorder. In one embodiment, the individual has 
been clinically diagnosed as having bipolar disorder. 

As used herein, "position 31" and "position 858 " refer to nucleotide positions of 
the BDNF gene corresponding to positions 3 1 and 858, respectively of SEQ ID NO: 1 , 

15 or the complement thereof When referring to the complementary strand, it is 

understood that the complementary base of the indicated nucleotide of interest. The 
nucleotide positions of the polymorphisms can be referred to in a number of different 
ways. For convenience, the following table provides a cross-reference between two 
common numbering schemes: the numbering based on GenBank sequence M61 181 

20 (SEQ ID NO: 1), and numbering based on the starting codon of the prepro protein 
(where the first nucleotide in the starting ATG codon is "1" and for example, the 
nucleotide upstream of "1" is "-1." 



TABLE I 



Nucleotide Position of 
SEQ ID NO: 1 


Nucleotide Position Relative to First 
Position of Starting ATG' 


31 


-633 


858 


196 



25 



1. Position 663 of SEQ ID NO: 1. 



The genetic material to be assessed can be obtained from any nucleated cell from 
the individual. For assay of genomic DNA, virtually any biological sample (other than 
pure red blood cells) is suitable. For example, convenient tissue samples include whole 
blood, semen, saliva, tears, urine, fecal material, sweat, skin and hair. For assay of 

5 cDNA or mRNA, the tissue sample must be obtained from an organ in which the target 
nucleic acid is expressed. For example, cells from the central nervous system (such as 
cells of the hippocampus), neural crest-derived cells, skin, heart, lung and skeletal 
muscle are suitable sources for obtaining cDNA for the BDNF gene. Neural crest 
derived cells include, for example, melanocytes and keratinocytes. 

10 Many of the methods described herem require amplification of DNA from target 

samples. This can be accompUshed by e.g., PGR. See generally PCR Technology: 
Principles and Applications for DNA Amplification (ed. H.A. Erlich, Freeman Press, 
NY, NY, 1992); PCR Protocols: A Guide to Methods and Applications (eds. Innis, et al. 
Academic Press, San Diego, CA, 1990); Mattila et al. Nucleic Acids Res. 19, 4967 

15 (1991); Eckert et al, PCR Methods and Applications 1, 17 (1991); PCR (eds. 
McPherson et al, IRL Press, Oxford); and U.S. Patent 4,683,202. 

Other suitable ampUfication methods include the ligase chain reaction (LCR) 
(see Wu and Wallace, Genomics A, 560 (1989), Landegren et al. Science 241, 1077 
(1988), transcription ampUfication (Kwoh et al, Proc. Natl Acad. Sci. USA 86, 1173 

20 (1989)), and self-sustained sequence replication (GuatelU et al, Proc. Nat. Acad. Sci. 
USA, 87, 1874 (1990)) and nucleic acid based sequence amplification (NASBA). The 
latter two amplification methods involve isothermal reactions based on isothermal 
transcription, which produce both single stranded RNA (ssRNA) and double straided 
DNA (dsDNA) as the ampUfication products in a ratio of about 30 or 100 to 1, 

25 respectively. 

The nucleotide which occupies the polymorphic site of interest (e.g., nucleotide 
position 3 1 in BDNF as numbered in SEQ JD NO: 1) can be identified by a variety of 
methods, such as Southern analysis of genomic DNA; direct mutation analysis by 
restriction enzyme digestion; Northern analysis of RNA; denaturing high pressure liquid 
30 chromatography (DHPLC); gene isolation and sequencing; hybridization of an allele- 
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specific oligonucleotide with amplified gene products; single base extension (SBE); or 
analysis of the BDNF protein. In a preferred embodiment, determination of the allehc 
form of BDNF is carried out using SBE-FRET methods as described in the examples, or 
using chip-based oligonucleotide arrays. A sampling of suitable procedures are 
5 discussed below in turn. 

1. Allele-Specific Probes 

The design and use of allele-specific probes for analyzing polymorphisms is 
described by e.g., Saiki et aL, Nature 324, 163-166 (1986); Dattagupta, EP 235,726, 
Saiki, WO 89/1 1548. Allele-specific probes can be designed that hybridize to a segment 

10 of target DNA fi"om one individual but do not hybridize to the corresponding segment 
from another individual due to the presence of different polymorphic forms in the 
respective segments from the two individuals. Hybridization conditions should be 
sufficiently stringent that there is a significant difference in hybridization intensity 
between alleles, and preferably an essentially binary response, whereby a probe 

1 5 hybridizes to only one of the alleles. Hybridizations are usually performed under 
stringent conditions, for example, at a salt concentration of no more than 1 M and a 
temperature of at least 25T. For example, conditions of 5X SSPE (750 mM NaCl, 
50 mM NaPhosphate, 5 mM EDTA, pH 7.4) and a temperature of 25-30X, or 
equivalent conditions, are suitable for allele-specific probe hybridizations. Equivalent 

20 conditions can be determined by varying one or more of the parameters given as an 

example, as known in the art, while maintaining a similar degree of identity or similarity 
between the target nucleotide sequence and the primer or probe used. 

Some probes are designed to hybridize to a segment of target DNA such that the 
polymorphic site ahgns with a central position (e.g., in a 15-mer at the 7 position; in a 

25 16-mer, at either the 8 or 9 position) of the probe. This design of probe achieves good 
discrimination in hybridization between different allelic forms. 

Allele-specific probes are often used in pairs, one member of a pair showing a 
perfect match to a reference form of a target sequence and the other member showing a 
perfect match to a variant form. Several pairs of probes can then be immobihzed on the 
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same support for simultaneous analysis of multiple polymorphisms within the same 
target sequence. Allele specific probes can comprise DNA, peptide nucleic acid (PNA) 
and RNA, or combinations thereof. 

2. Tiling Arrays 

The polymorphisms can also be identified by hybridization to nucleic acid arrays, 
some examples of which are described in WO 95/11995. WO 95/11995 also describes 
subarrays that are optimized for detection of a variant form of a precharacterized 
polymorphism. Such a subarray contains probes designed to be complementary to a 
second reference sequence, which is an allelic variant of the first reference sequence. 
The second group of probes is designed by the same principles, except that the probes 
exhibit complementarity to the second reference sequence. The mclusion of a second 
group (or fiirther groups) can be particularly useful for analyzing short subsequences of 
the primary reference sequence in which multiple mutations are expected to occur within 
a short distance commensurate with the length of the probes (e.g., two or more mutations 
within 9 to 21 bases). 

3. Allele-Specific Primers 

An allele-specific primer hybridizes to a site on target DNA overiapping a 
polymorphism and only primes ampUfication of an allehc form to which the primer 
exhibits perfect complementarity. See Gibbs, Nucleic Acid Res. 17, 2427-2448 (1989). 
This primer is used in conjunction with a second primer which hybridizes at a distal site. 
AmpUfication proceeds fi:om the two primers, resulting in a detectable product which 
indicates the particular alleUc form is present. A control is usually performed with a 
second pair of primers, one of which shows a single base mismatch at the polymorphic 
site and the other of which exhibits perfect complementarity to a distal site. The single- 
base mismatch prevents amplification and no detectable product is formed. The method 
works best when the mismatch is included in the 3'-most position of the oligonucleotide 
aligned with the polymorphism because this position is most destabilizing to elongation 
firom the primer (see, e.g., WO 93/22456). 
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4. Direct-Sequencing 

The direct analysis of the sequence of polymorphisms of the present invention 
can be accomplished using either the dideoxy chain termmation method or the Maxam - 
Gilbert method (see Sambrook et al. Molecular Cloning, A Laboratory Manual (2nd 
5 Ed., CSHP, New York 1989); Zj^kind et al. Recombinant DNA Laboratory Manual, 
(Acad. Press, 1988)). 



5. Denaturing Gradient Gel Electrophoresis 
H Amplification products generated using the polymerase chain reaction can be 

p analyzed by the use of denaturing gradient gel electrophoresis. Different alleles can be 

^ 10 identified based on the different sequence-dependent melting properties and 

1^^ electrophoretic migration of DNA in solution. Erlich, ed., PCR Technology, Principles 

2 and Applications for DNA Amplification, (W.H. Freeman and Co, New York, 1992), 

Chapter 7. 



6. Single-Strand Conformation Polymorphism Analysis 

1 5 Alleles of target sequences can be differentiated using single-strand conformation 

polymorphism analysis, which identifies base differences by alteration in electrophoretic 
migration of single stranded PCR products, as described in Orita et al, Proc, Nat Acad. 
Set 86, 2766-2770 (1989). AmpUfied PCR products can be generated as described 
above, and heated or otherwise denatured, to form single stranded amplification 

20 products. Single-stranded nucleic acids may refold or form secondary structures which 
are partially dependent on the base sequence. The different electrophoretic mobilities of 
single-stranded ampUfication products can be related to base-sequence differences 
between alleles of target sequences. 



7. Single-Base Extension 
25 An alternative method for identifying and analyzing polymorphisms is based on 

single-base extension (SBE) of a fluorescently-labeled primer coupled with fluorescence 
resonance energy transfer (FRET) between the label of the added base and the label of 
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the primer. Typically, the method, such as that described by Chen et al, (PNAS 
94:10756-61 (1997), incorporated herein by reference) uses a locus-specific 
oligonucleotide primer labeled on the 5' terminus with 5-carboxyfluorescein (FAM). 
This labeled primer is designed so tiiat the 3' end is immediately adjacent to the 

5 polymorphic site of interest. The labeled primer is hybridized to the locus, and single 
base extension of the labeled primer is performed with fluorescently labeled 
dideoxyribonucleotides (ddNTPs) in dye-terminator sequencing fashion, except that no 
deoxyribonucleotides are present. An increase in fluorescence of the added ddNTP in 
response to excitation at the wavelength of the labeled primer is used to infer the identity 

10 of the added nucleotide. 

The polymorphisms of the invention may contribute to the protection of an 
individual against bipolar disorder in different ways. The polymorphisms may 
contribute to phenotype by affecting protein structure. By altering amino acid sequence, 
the polymorphism may alter the function of the encoded protein. The polymorphisms 

15 may exert phenotypic effects indirectly via influence on replication, transcription, and 
translation. For example, the substitution of a methionine for a vahne in the prepro 
portion of the BDNF gene product may create an alternative translation start site which 
alters the length of the gene product and the prepro portion itself. Alteration of the 
length of the gene product may affect cleavage of the mature protein either positively or 

20 negatively. Alternatively, the presence of the variant amino acid may alter the properties 
of the gene product so as to dter cleavage of the gene product. More than one 
phenotypic trait may be affected. For example, other neuropsychiatric disorders which 
are beheved to be alternate expressions of a bipolar genotype, including variants of 
schizoaffective disorder, recurrent unipolar depression and hypomania (bipolar II 

25 disorder), may also be affected by the BDNF polymorphisms described herein. 

Additionally, the described polymorphisms may predispose an individual to a distinct 
mutation that is causally related to a certain phenotype, such as susceptibiUty or 
resistance to bipolar disorder. The discovery of tiie polymorphisms and correlation with 
bipolar disorder facilitates biochemical analysis of the variant and the development of 
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assays to characterize the variant and to screen for pharmaceuticals that interact directly 
with one or another form of the protein. 

Alternatively, the polymorphisms may be one of a group of two or more 
polymorphisms in the BDNF gene which contributes to the presence, absence or severity 
5 of the neuropsychiatric disorder, e.g., bipolar disorder. An assessment of other 
polymorphisms within the BDNF gene can be undertaken, and the separate and 
combined effects of these polymorphisms on the neuropsychiatric disorder phenotype 
can be assessed. 

Correlation between a particular phenotype, e.g*, the bipolar phenotype, and the 
10 presence or absence of a particular allele is performed for a population of individuals 
who have been tested for the presence or absence of the phenotype. Correlation can be 

performed by standard statistical methods such as a Chi-squared test and statistically 
significant correlations between polymorphic form(s) and phenotypic characteristics are 

Q noted. For example, as described herein, it has been found that the presence of the 

fi-t 

st 15 BDNF variant allele, having an A at polymorphic site 858 (as numbered in SEQ ID NO: 

W 1), correlates negatively with bipolar disorder with a p value of p=0.004 by Chi-squared 

■m ■ test 

This correlation can be exploited in several ways. In the case of a strong 
correlation between a particular polymorphic form, e.g., the reference allele for BDNF, 

20 and a disease for which treatment is available, e.g., bipolar disorder, detection of the 
polymorphic form in an individual may justify immediate administration of treatment, or 
at least the institution of regular monitoring of the individual. Detection of a 
polymorphic form correlated with a disorder in a couple contemplating a family may 
also be valuable to the couple in their reproductive decisions. For example, the female 

25 partner might elect to undergo in vitro fertiUzation to avoid the possibihty of 

transmitting such a polymorphism from her husband to her offspring. In the case of a 
weaker, but still statistically significant correlation between a polymorphic form and a 
particular disorder, immediate therapeutic intervention or monitoring may not be 
justified. Nevertheless, the individual can be motivated to begin simple life-style 

30 changes (e.g., therapy or counseling) that can be accompUshed at little cost to the 
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individual but confer potential benefits in reducing the risk of conditions to which the 
individual may have increased susceptibiUty by virtue of the particular allele. 
Furthermore, identification of a polymorphic form correlated with enhanced 
receptiveness to one of several treatment regimes for a disorder indicates that this 
5 treatment regime should be followed for the individual in question. 

Furthermore, it may be possible to identify a physical linkage between a genetic 
locus associated with a trait of interest (e.g,, bipolar disorder) and polymorphic markers 
that are not associated with the trait, but are in physical proximity with the genetic locus 
responsible for the trait and co-segregate with it. Such analysis is useful for mapping a 
10 genetic locus associated with a phenotypic trait to a chromosomal position, and thereby 
H cloning gene(s) responsible for the trait. See Lander et ah, Proc, Natl Acad. Sci (USA) 

I 83, 7353-7357 (1986); Lander et al, Proa Natl. Acad Sci. (USA) 84, 2363-2367 (1987); 

Donis-Keller et al. Cell 51, 319-337 (1987); Lander et a/., Genetics 111, 185-199 
(1989)). Genes locahzed by linkage can be cloned by a process known as directional 
15 cloning. See Wainwright, Med. 1 Australia 159, 170-174 (1993); Collins, Nature 
Genetics h3'6 {1992). 

ry Linkage studies are typically performed on members of a family. Available 

members of the family are characterized for the presence or absence of a phenotypic trait 
and for a set of polymorphic markers. The distribution of pol>morphic markers in an 
20 informative meiosis is then analyzed to determine which polymorphic markers co- 
segregate with a phenotypic trait. See, e.g., Kerem et aL^ Science 245, 1073-1080 

(1989) ; Monaco et al. Nature 316, 842 (1985); Yamoka et al. Neurology 40, 222-226 

(1990) ; Rossiter et al, FASEB Journal 5, 21-27 (1991). 

Linkage is analyzed by calculation of LOD (log of the odds) values. A LOD 
25 value is the relative likelihood of obtaining observed segregation data for a marker and a 
genetic locus when the two are located at a recombination fraction 6, versus the situation 
in which the two are not linked, and thus segregating independently (Thompson & 
Thompson, Genetics in Medicine (5th ed, W.B. Saunders Company, Philadelphia, 1991); 
Strachan, "Mapping the human genome" in The Human Genome (BIOS Scientific 
30 PubUshers Ltd, Oxford), Chapter 4). A series of hkehhood ratios are calculated at 
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various recombination fractions (0), ranging from 6 = 0.0 (coincident loci) to 0 = 0,50 
(unlinked). Thus, the likelihood at a given value of 0 is: probability of data if loci 
linked at 0 to probability of data if loci unlinked. The computed likelihoods are usually 
expressed as the logjo of this ratio (i.e., a LOD score). For example, a LOD score of 3 
indicates 1000:1 odds against an apparent observed linkage being a coincidence. The 
use of logarithms allows data collected from different families to be combined by simple 
addition. Computer programs are available for the calculation of LOD scores for 
differing values of 0 (e.g., LIPED, MLINK (Lathrop, Proc. Nat Acad. Sci. (USA) 81, 
3443-3446 (1984)). For any particular LOD score, a recombination fraction may be 
determined from mathematical tables. See Smith et aL, Mathematical tables for 
research workers in human genetics (Churchill, London, 1961); Smith, Ann, Hum. 
Genet. 32, 127-150 (1968). The value of 0 at which the LOD score is the highest is 
considered to be the best estimate of the recombination fraction. 

Positive LOD score values suggest that the two loci are linked, whereas negative 
values suggest that linkage is less likely (at that value of 0) than the possibility that the 
two loci are imlinked. By convention, a combined LOD score of +3 or greater 
(equivalent to greater than 1000:1 odds in favor of linkage) is considered definitive 
evidence that two loci are linked. Similarly, by convention, a negative LOD score of -2 
or less is taken as definitive evidence against linkage of the two loci being compared. 
Negative linkage data are usefiil in excluding a chromosome or a segment thereof from 
consideration. The search focuses on the remaining non-excluded chromosomal 
locations. 

The invention also encompasses kits for detecting the presence of proteins or 
nucleic acid molecules of the invention in a biological sample. For example, the kit can 
comprise a compound or agent (e.g., one or more nucleic acid probes) capable of 
detecting protein or mRNA (or cDNA produced from the mRNA) in a biological sample 
or means for determining the identity of a particular nucleotide or amino acid of the 
BDNF gene or protein, respectively. For example, in one embodiment, the kit comprises 
a means for determining the identity of nucleotide 3 1 of BDNF gene as numbered in 
SEQ ID NO: 1. In another embodiment, the compound or agent, such as 
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oligonucleotide(s) or antibody(ies) is labeled. In still another embodiment, the kit 
includes fluorescently labeled dideoxynucleotides. The kit can also comprise control 
samples for use as standards, representing individuals homozygous for the reference or 
variant nucleotide in the case of analyzmg nucleic acid, or the reference or variant amino 
5 acid in tiie case of analyzing proteins, or representing a heterozygous individual. The 
compound or agent can be packaged in a suitable container. The kit can further 
comprise instructions for using the kit to detect protein or nucleic acid. 

The invention fiirther pertains to compositions, e.g., vectors, comprising a 
nucleotide sequence encoding variant BDNF gene. In one embodiment, the gene 
10 comprises BDNF sequences including position 31 of SEQ ID NO: 1. 

For example, the BDNF gene or variants thereof can be expressed in an 
expression vector in which a variant gene is operably linked to a native or ottier 
promoter. Usually, the promoter is a eukaryotic promoter for expression in a 
mammalian cell. The transcription regulation sequences typically include a heterologous 
1 5 promoter and optionally an enhancer which is recognized by the host. The selection of 
an appropriate promoter, for example trp, lac, phage promoters, glycolytic enzyme 
promoters and tRNA promoters, depends on the host selected. Commercially available 
expression vectors can be used. Vectors can include host-recognized replication 
systems, ampUfiable genes, selectable markers, host sequences useful for insertion into 
20 the host genome, and the like. 

The means of introducing the ejipression construct into a host cell varies 
depending upon the particular construction and the target host. Suitable means include 
fusion, conjugation, transfection, transduction, electroporation or injection, as described 
in Sambrook, supra. A wide variety of host cells can be employed for expression of the 
25 variant gene, both prokaryotic and eukaryotic. Suitable host ceUs include bacteria such 
as E. coli, yeast, filamentous fungi, insect cells, mammalian cells, typically 
immortalized, e.g., mouse, CHO, human and monkey cell lines and derivatives thereof 
Preferred host cells are able to process the variant gene product to produce an 
appropriate mature polypeptide. Processing includes glycosylation, ubiquitination, 
30 disulfide bond formation, general post-translational modification, and the hke. 
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It is also contemplated that cells can be engineered to express the BDNF allele of 
the invention by gene therapy methods. For example, DNA encoding the BDNF gene 
product, or an active fragment or derivative thereof, can be introduced into an expression 
vector, such as a viral vector, and the vector can be introduced into appropriate cells in 

5 an animal. In such a method, the cell population can be engineered to inducibly or 
constitutively express active BDNF gene product. In a preferred embodiment, the vector 
is delivered to the bone marrow, for example as described in Corey et al. (Science 
244:1275-1281 (1989)). 

The invention further provides transgenic nonhuman animals capable of 

10 expressing an exogenous BDNF gene and/or having one or both alleles of an 
endogenous BDNF gene inactivated. Expression of an exogenous gene is usually 
achieved by operably linking the gene to a promoter and optionally an enhancer, and 
microinjectmg the construct into a zygote. See Hogan et al, "Manipulating the Mouse 
Embryo, A Laboratory Manual," Cold Spring Harbor Laboratory, haactivation of 

1 5 endogenous genes can be achieved by forming a transgene in which a cloned variant 
gene is inactivated by insertion of a positive selection marker. See Capecchi, Science 
244, 1288-1292 (1989). The transgene is then introduced into an embryonic stem cell, 
where it undergoes homologous recombination with an endogenous gene. Mice and 
other rodents are preferred animals. Such animals provide useful drug screening 

20 systems. 

The invention ftirther relates to an oligonucleotide microairay having immobilized 
thereon a plurality of oligonucleotide probes wherein at least one of said probes is 
specific for the nucleotide at position 3 1 of SEQ ID NO: 1 . In one embodiment, the 
invention relates to an oligonucleotide microarray having immobilized thereon a 

25 plurahty of oUgonucleotide probes wherein at least one of said probes is specific for the 
variant nucleotide at position 3 1 of SEQ ID NO: 1 . In another embodiment, the 
invention relates to an oligonucleotide microarray having immobilized thereon a 
plurahty of oligonucleotide probes wherein at least one of said probes is specific for the 
reference nucleotide at position 3 1 of SEQ ID NO: 1 . The nucleic acid sequence 

30 surrounding the position 3 1 of SEQ ID NO: 1 can be used to design suitable 
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oligonucleotide probes, and the preparation of such oligonucleotide microarrays is well 
known in the art. 

The invention will be further illustrated by the following non-limiting examples. 
The teachings of the references cited herein are incorporated herein by reference in their 
5 entirety. 



EXAMPLES 

Sample Population 

^ A sample population of 1 50 trios was initially assessed by genotyping methods for 

S 10 heterozygousity witibi respect to the BDNF reference and variant alleles as described 

herein. A trio included two parents and an offspring diagnosed as having bipolar 
^ disorder according to the American Psychiatric Association's Diagnostic and Statistical 

U Manual of Mental Disorders. Of the 150 trios assessed, 98 of these trios had at least one 

Q parent who was heterozygous for the BDNF reference and variant alleles; these 98 trios 

p 15 were selected for fiulher study, as the heterozygousity of the parent allowed a 

M determination ofwhich allele the parent transmitted to the bipolar offspring. The bipolar 

offspring in the trios were assessed by genotyping methods to determine which BDNF 
allele had been transmitted to them by the heterozygous parent. In instances where two 
parents had two offspring diagnosed with bipolar disorder, each trio (i.e., two parents 
20 and one offspring) was considered individually. 



SBE-FRET Protocol 

The genotyping method used for these studies was based on single-base extension 
(SBE) and fluorescence resonance energy transfer (FRET). A locus-specific primer 
(FRET primer; 5'-GGCTGACACTTTCGAACAC (SEQ ID NO: 3) was ordered 
25 5'labeled with FAM. The primer was designed so that the 3' end hybridized immediately 
adjacent to the polymorphic site of interest (e.g., nucleotide 858), such that a single-base 
extension of the primer would result in the addition of a nucleotide complementary to 
the template polymorphic site of interest. Locus specific primer for other positions of 
interest, such as position 3 1 of SEQ ID NO: 1 , can be readily designed and labeled with 
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FAM using methods well known in the art. The locus of interest was ampUfied and 
single base extension of the FRET primer was performed with fluorescently labeled 
ddNTPs in dye-terminator sequencing fashion, except that no deoxyribonucleotides are 
present. PGR primers were: 

Forward PGR primer 5'-TGTAAAAGGACGGCCAGTGTTGACATG 
ATTGGCTGACACT (SEQ ID NO: 4); and 

Reverse PGR primer 5'-TAATACGACTGACTATAGGGGTAGAAGGT 
GCGTCGTTATTGTTT (SEQ ID NO: 5). 

The ddNTP corresponding to the variant base (A) was labeled with TAMRA, and 
the reference base (G) was labeled with ROX. Depending on the genotype of the 
individual, the FRET primer was extended with a ROX-labeled or TAMRA-labeled 
ddNTP. Upon incorporation of either ROX- or TAMRA-labeled ddNTPs, energy 
transfer occurs between the donor dye ^AM on FRET primer) and the acceptor dye (the 
ROX- or TAMRA-labeled ddNTP). An increase in the fluorescence intensity of one (for 
a homozygote) or both (for a heterozygote) of the acceptor dyes was used to infer the 
genotype of an individual. 

Summary of experimental procedures used in the above-described analysis. 
I Amplify locus of interest 

n Clean-up of PGR products with shrimp alkaline phosphate (SAP) and 

Exonuclease I (EXO) 
ni Single-base extension/fluorescence detection in ABI7700 
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Amplification of locus of interest - for 96-well plate 



TABLE H: PCRMK 



ft! 

m 





Each Reaction (^L) 


For 96-well plage (^L) 


lOmMdNTP 


0.05 


5.2 


lOXPCRn buffer 


2.0 


208 


25 mM MgC12 


1.2 


125 


20 (iM PGR primer F 


0.25 


26 


20 |iM PGR primer R 


0.25 


26 


ddHp 


11.05 


1149 


5 U/|xL Amplitaq-gold 


0.2 


20.8 




15 





1 0 Fifteen microliters of the PGR mix were added to a 96-well MJ plate. Five 

microliters of genomic DNA (5 ng/)iL) were added to the aliquoted PGR mix, (5 jiL of 
1 ng/|xL is often adequate). 



The plate was sealed with MJ plate-seal W. 

PGR was conducted using the following program: 
15 96Xx 10 minutes 

96'^G x 30 seconds, 50C x 1 minute, 72C x 1 minute for 35 cycles 
72°C X 10 minutes followed by a hold at 4^G 



i 
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PGR product clean-up 



TABLE in: SAP/EXOMIX 





Each reaction (nL) 


For 96-well plate (|iL) 


Shrimp alkaline phosphatase 

(lU/^L) 


1.0 


104 


Exonuclease 1 (lOU/nL) 


0.05 


5.2 


1 OX SAP buffer 


1.0 


104 


ddHp 


2.95 


306.8 




5.0 





11 



Five microliters of SAP/EXO mix were added to a clean MJ plate. Five 
10 microliters of the PGR product were added directly to the aliquoted SAP/EXP mix. The 
PGR plates were spun down aad sealed with Microseal A film. The mixture was 
incubated at 37^G for 45 minutes and then at 96°G for 15 minutes. 



in Single-base extension/fluorescence detection in ABI7700 

(The reactions were carried out in the same MJ plate used for SAP/EXO step, 
1 5 capped with 8-strip MicroAmp optical caps) 

The ddNTPs that should be incorporated in the genotyping reaction were selected. 

hi this experiment, TAMRA was used to identify the variant base and ROX for the 

reference base, although other possibilities exist. 
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TABLEIV: SBE-FRETMIX 





Each reaction (fiL) 


For 96-well plate (jiL) 


FAM primer (100 uM) 


0.02 


2.08 


ROXddNTP(lOOum) 


0.02 


2.08 


TAMRA ddNTP (100 urn) 


0.02 


2.08 


Thermoseq. Buffer (lOx) 


2.0 


208 


ddH20 


7.9 


821.6 


Thermosequenase 
(32U/HL) 


0.016 


1.7 




10.0 





Ten microliters of SBE-FRET mix were added to the MJ plates containing 10 [iL 
SAP/Exo treated PGR products. 

The plates were tapped on bench to mix, they can also be spun briefly if 

necessary. 

• The wells were capped with optical caps. The capped wells can be rolled with 
roller if necessary. 

• The plates were placed in a thermocycling detector apparatus (ABI7700). 
The plates were incubated for 6 cycles of (for a 20 |iL reaction) as follows: 

96°Cx 15 seconds 
50°C X 30 seconds 
60°C x 30 seconds 
Data were collected in the 60°C stage using detection settings suitable for 
measuring TAMRA and ROX fluorescence. 

Data were analyzed by plotting ROX fluorescence versus TAMRA fluorescence 
and comparing the values between samples, control samples containing no template and 
samples of known geneotype. Typically, homozygous reference controls have little or 
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no TAMRA fluorescence, homozygous variant controls have little or no ROX 
fluorescence and heterozygous controls have similar TAMRA and ROX fluorescence. 

Results 

Data from the work described herein has shown that there is a variation from 
random (i.e., that which would be expected by chance) in the transmission at position 
858 of SEQ JD NO: 1 of the reference (G) and variant (A) alleles from an individual 
parent who is heterozygous for the BDNF alleles to an offspring diagnosed with bipolar 
disorder. 

The data demonsfrated that the variant allele (A) is transmitted less frequenfly (34 
of 98 times) to the bipolar offspring than would be expected by chance, while the 
reference allele (G) is transmitted more frequently (64 of 98 times) than would be 
expected by chance (p= 0.004). In the general population (in which about 0.8 percent of 
the individuals are diagnosed with bipolar disorder), the variant (A) allele occurs with a 
frequency of 15 percent, while the reference allele occurs with a frequency of 85 percent. 
In the sample population assessed as described herein, in which all of the individuals are 
diagnosed with bipolar disorder, the variant allele occurs with a frequency of 7 percent. 
Thus, it appears that the variant allele may contribute to protection or reduction in 
symptomology witii respect to bipolar disorder. 

Figure 2 and Table V show data obtained from additional human samples. 
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TABLEV 



Samples 


Trans 


Untrans. 


p val 


# trans 


relative risk 


95% CF 


# trios 


Hopkins 


55 


29 


0.0023 


84 


1.90 


1.25-3.0 


127 


U01 + 
NIMH 


50 


42 


0.2021 


92 


1.19 


0.83-1.8 


155 


British 


38 


32 


0.2366 


70 


1.19 


0.77-1.8 


145 


all repl. 


88 


74 


0.1357 


162 


1.19 


0.91-1.8 


300 


all BP 


143 


103 


0.0054 


246 


1.39 


1.1-1.8 


427 



"Hopkins" refers to a group of patients with bipolar disorder obtained in 
collaboration with Johns Hopkins. "UOl and NIMH" refer to a group of 1 55 trios, some 
of which are from Johns Hopkins and some of which are from the Genetics hiitiative at 
the NIMH. '"British" refers to 145 trios from 5 collaborators in England. 

In Table V, "Trans" is the number of times the allele in question (at position 858, 
in this case the reference allele) was transmitted from a heterozygous parents to a bipolar 
child. "Untrans" is the number of times the other (variant) allele was passed from the 
heterozygous parent to the bipolar child. The number of trios used is show in the 
column labeled "#trios" and is the number of trios for which genotypes were available. 
Not all ofthe parents were considered to be "informative". To be included in the 
analysis, the parent in question had to be a heterozygote. 

The relative risk (estimated relative risk on Figure 2) is defined as the 
transmission ratio in trios (i.e # transmitted alleles/# untransmitted alleles). Under a 
multiplicative disease model, this is an estimator of genotypic relative risk. The 
confidence mterval was calculated using a binomial distribution. 

The combination of "T" at position 31 and "A" at position 858 was tested and 
found to be in nearly complete linkage disequiUbrium in both the Hopkins and the 
NIMH datasets. 
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While this invention has been particularly shown and described with references to 
preferred embodiments thereof, it will be understood by those skilled in the art that 
various changes in form and details may be made therein without departing from the 
scope of the invention encompassed by the appended claims. 



